Audio segmentation using Flattened Local Trimmed Range for ecological acoustic space analysis

نویسندگان

  • Giovany Vega
  • Carlos J. Corrada-Bravo
  • T. Mitchell Aide
چکیده

The acoustic space in a given environment is filled with footprints arising from three processes: biophony, geophony and anthrophony. Bioacoustic research using passive acoustic sensors can result in thousands of recordings. An important component of processing these recordings is to automate signal detection. In this paper, we describe a new spectrogram-based approach for extracting individual audio events. Spectrogram-based audio event detection (AED) relies on separating the spectrogram into background (i.e., noise) and foreground (i.e., signal) classes using a threshold such as a global threshold, a per-band threshold, or one given by a classifier. These methods are either too sensitive to noise, designed for an individual species, or require prior training data. Our goal is to develop an algorithm that is not sensitive to noise, does not need any prior training data and works with any type of audio event. To do this, we propose: (1) a spectrogram filtering method, the Flattened Local Trimmed Range (FLTR) method, which models the spectrogram as a mixture of stationary and non-stationary energy processes and mitigates the effect of the stationary processes, and (2) an unsupervised algorithm that uses the filter to detect audio events. We measured the performance of the algorithm using a set of six thoroughly validated audio recordings and obtained a sensitivity of 94% and a positive predictive value of 89%. These sensitivity and positive predictive values are very high, given that the validated recordings are diverse and obtained from field conditions. The algorithm was then used to extract audio events in three datasets. Features of these audio events were plotted and showed the unique aspects of the three acoustic communities. Subjects Bioinformatics, Computational Biology

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic quality assessment at Nezamol molk dome of Jame mosque of Isfahan

 Incontrovertibly, the sense of hearing is one of the five most substantial human senses. In fact, the human ear receives sound and transmits to the human brain by the auditory organs. Hence, sound can be considered as one of the key tools of human communication with each other and the environment around them. Since acoustic has a profound impact on the body, soul, and the performance of human ...

متن کامل

Green Space Suitability Analysis Using Evolutionary Algorithm and Weighted Linear Combination (WLC) Method

With current new urban developments, no balance can be found between green spaces and open areas present within urban networks and natural land patterns since urban networks are dominating ecological networks. Accordingly, one of the major tasks of urban and regional planners is the optimal land use allocation to urban green spaces. Therefore, to achieve this goal in this research, locations of...

متن کامل

On Building and Evaluating a Broadcast-News Audio Segmentation System

Audio segmentation is useful in diverse applications like audio indexing and retrieval, subtitling, monitoring of acoustic scenes, etc. Also, an initial audio segmentation stage may help to improve the robustness of speech technologies like automatic speech recognition and speaker diarization. In this paper, firstly, the Albayzín-2010 audio segmentation evaluation is reported, including some co...

متن کامل

Evaluation of Biophonies in Isfahan Parks, Using Acoustic Indices

Soundscape analysis, using acoustic indices, provides researchers with valuable ecological information to assess biodiversity, species behavior, and noise pollution. In this study,  to analyze biophonies and anthrophony in six urban parks in  Isfahan city in spring, six acoustic indices, including Acoustic complexity index (ACI), Acoustic Diversity Index (ADI), Acoustic Evenness Index (AEI), an...

متن کامل

Speech and Crosstalk Detection in Multi - Channel Audio Stuart

––The analysis of scenarios in which a number of microphones record the activity of speakers, such as in a round-table meeting, presents a number of computational challenges. For example, if each participant wears a microphone, it can receive speech from both the microphone's wearer (local speech) and from other participants (crosstalk). The recorded audio can be broadly classified in four ways...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PeerJ Computer Science

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2016